IN THE SPECIFICATION: 

Please revise the specification as follows: 

Please amend paragraphs number 72 and 73 as follows: 
[0072] Referring again to Fig. 6, document 1002 contains a plurality of outbound 

links 1010. Each outbound link 1010 points to a target document address, typically the URL 
of a target document. For example, link 1010-1 points to the URL of target document 1012- 
1. Links 1010 are typically contained within a region 1011 of document 1002 known as an 
"anchor tag." The structure and function of anchor tags are well-known to those of skill in 
the art of hypertext markup language (HTML) composition and interpretation. Amongst 
other rrthell features of anchor tag 1011 , anchor tag 1011 may include anchor text. Anchor 
text is contained in document 1002 near the URL associated with link 1011. Typically, the 
anchor text in anchor tag 1011 is delimited by the opening and closing markup tags "<a>" 
and "</a>," respectively. 

[0073] The anchor text in anchor tag 1011 may contain useful information about 
document 1012-1. For example, the anchor text may include the statement "this is an 
interesting website about cats." If document 1012-1 is unavailable for retrieval at the time 
crawling of collection 1000 is performed, this anchor text provides textual information that 
can be searched by keyword. Document 1012-1 may be unavailable for crawling because the 
server on which it is hosted is not operational at the time of crawling, the server on which it is 
hosted challenges the robot for a password, or any number of other reasons. Additionally, 
document 1012-1 may be an image file, a video file, or an audio file, in which case there is no 
textual information readily available from the contents of document 1012-1. So, if the text 
from anchor tag 1011 is indexed as part of the indexing of document 1012-1, a user who 
submits a query containing the term "cat" may receive a list of documents including 
document 1012-1. Another advantage of indexing the anchor text from anchor tag 1011 
together with document 1012-1 occurs in cases where document 1002 contains more accurate 
information about document 1012-1 than the textual contents of document 1012-1 itself. For 
example, document 1 002 may be a relatively authoritative web page that contains text near or 
in an anchor tag associated with link 1010-1 stating that "the server that hosts web page 
1012-1 is frequently unavailable." Page 1012-1 may contain no text indicating that it is 
frequently unavailable. If page 1012-1 is successfully crawled [[an]] and indexed, a user of a 
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search engine employing the index will have no way to learn of the potential unavailability of 
page 1012-1 unless information from page 1002 is returned in response to a query. 
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